[Experimental] Add a RuntimeOption to set inter and intra op threadpool sizes #410

VivekPanyam · 2020-08-07T23:48:42Z

Summary:

Allow users to configure intra-op and inter-op thread pool sizes for the underlying frameworks.

Note: This API is experimental and may change in the future.

Test Plan:

source/neuropod/backends/tensorflow/tf_backend.cc

vkuzmin-uber · 2020-08-08T02:37:01Z

source/neuropod/backends/tensorflow/tf_backend.cc

@@ -93,7 +93,7 @@ void check_tf_status(const tensorflow::Status &status)
 }

 // Get TF session options given Neuropod RuntimeOptions
-tensorflow::SessionOptions get_tf_opts(const RuntimeOptions & /*unused*/)
+tensorflow::SessionOptions get_tf_opts(const RuntimeOptions &runtime_opts)


Not related. Some thoughts. RuntimeOption is used now in C++, C and Java API (and could be Python too). We need to keep it in sync. I have seen that Tensorflow using Proto declaration in such cases and then generates struct for languages.

We may use similar approach.

Makes sense - definitely something to look into.

I don't like TF's approach with options for their C API though. They basically require a buffer with a serialized proto as input. This makes it fairly complicated to set options directly from C.

codecov · 2021-03-25T00:42:30Z

Codecov Report

Attention: Patch coverage is 45.71429% with 19 lines in your changes missing coverage. Please review.

Project coverage is 87.83%. Comparing base (5f0f371) to head (726d6fb).
Report is 52 commits behind head on master.

Files with missing lines	Patch %	Lines
source/neuropod/backends/tensorflow/tf_backend.cc	35.00%	13 Missing ⚠️
...rce/neuropod/backends/torchscript/torch_backend.cc	60.00%	6 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #410      +/-   ##
==========================================
- Coverage   88.04%   87.83%   -0.22%     
==========================================
  Files         106      106              
  Lines        6893     6928      +35     
==========================================
+ Hits         6069     6085      +16     
- Misses        824      843      +19

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

vkuzmin-uber · 2021-04-25T13:46:55Z

source/neuropod/backends/torchscript/torch_backend.cc

+    // See https://pytorch.org/docs/stable/notes/cpu_threading_torchscript_inference.html#runtime-api
+    if (options.experimental_inter_op_parallelism_threads != 0)
+    {
+        at::set_num_interop_threads(static_cast<int32_t>(options.experimental_inter_op_parallelism_threads));


As I see, there is a torch::set_num_interop_threads that is supposed to be "public". Minor, but I think it is issue still.

same about other.

vkuzmin-uber · 2021-04-25T13:49:31Z

source/neuropod/backends/torchscript/torch_backend.cc

+
+    if (options.experimental_intra_op_parallelism_threads != 0)
+    {
+        at::set_num_threads(static_cast<int32_t>(options.experimental_intra_op_parallelism_threads));


We need to give access to get_num_* somehow. User sets runtime (or not), then starts neuropod execution and should be able to check what are the settings are used, it is important for "default" case, when system sets value and also for IPE case where models share it.

vkuzmin-uber · 2021-04-25T13:54:19Z

source/neuropod/backends/torchscript/torch_backend.cc

+#if CAFFE2_NIGHTLY_VERSION >= 20190808
+    // Set intra and inter op parallelism
+    // See https://pytorch.org/docs/stable/notes/cpu_threading_torchscript_inference.html#runtime-api
+    if (options.experimental_inter_op_parallelism_threads != 0)


For Interop case, it allows to set several times only for TBB case. For our case, if there are 2nd IPE model with non-zero value, it will fail. As I see in code, for non-TBB cases, it sets atomic var and I think it can be done here as well. I'd save still possibility to change it for TBB case. I am thinking about building libtorch for TBB, we have a Torchscript model/use case where TBB's "better" concurrency can be critical.

VivekPanyam commented Aug 7, 2020

View reviewed changes

source/neuropod/backends/tensorflow/tf_backend.cc Outdated Show resolved Hide resolved

vkuzmin-uber reviewed Aug 8, 2020

View reviewed changes

VivekPanyam mentioned this pull request Jan 20, 2021

Inter-op and intra-op threading parameters in PyTorch #473

Open

VivekPanyam force-pushed the set_threads branch from c9366b5 to 7387010 Compare January 26, 2021 07:42

VivekPanyam requested review from selitvin and olcayc January 26, 2021 07:43

VivekPanyam force-pushed the set_threads branch from 7387010 to b31a85a Compare January 26, 2021 07:46

VivekPanyam marked this pull request as ready for review January 26, 2021 07:53

VivekPanyam changed the title ~~Add a RuntimeOption to set inter and intra op threadpool sizes~~ [Experimental] Add a RuntimeOption to set inter and intra op threadpool sizes Jan 26, 2021

Update pip bootstrap URL

8b6fa81

VivekPanyam changed the base branch from master to fix_build March 24, 2021 23:58

Add a RuntimeOption to set inter and intra op threadpool sizes

726d6fb

VivekPanyam force-pushed the set_threads branch from b31a85a to 726d6fb Compare March 24, 2021 23:59

Base automatically changed from fix_build to master April 2, 2021 23:58

vkuzmin-uber reviewed Apr 25, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Experimental] Add a RuntimeOption to set inter and intra op threadpool sizes #410

[Experimental] Add a RuntimeOption to set inter and intra op threadpool sizes #410

VivekPanyam commented Aug 7, 2020 •

edited

Loading

vkuzmin-uber Aug 8, 2020

VivekPanyam Jan 26, 2021

codecov bot commented Mar 25, 2021 •

edited

Loading

vkuzmin-uber Apr 25, 2021

vkuzmin-uber Apr 25, 2021

vkuzmin-uber Apr 25, 2021

vkuzmin-uber Apr 25, 2021

[Experimental] Add a RuntimeOption to set inter and intra op threadpool sizes #410

Are you sure you want to change the base?

[Experimental] Add a RuntimeOption to set inter and intra op threadpool sizes #410

Conversation

VivekPanyam commented Aug 7, 2020 • edited Loading

Summary:

Test Plan:

vkuzmin-uber Aug 8, 2020

Choose a reason for hiding this comment

VivekPanyam Jan 26, 2021

Choose a reason for hiding this comment

codecov bot commented Mar 25, 2021 • edited Loading

Codecov Report

vkuzmin-uber Apr 25, 2021

Choose a reason for hiding this comment

vkuzmin-uber Apr 25, 2021

Choose a reason for hiding this comment

vkuzmin-uber Apr 25, 2021

Choose a reason for hiding this comment

vkuzmin-uber Apr 25, 2021

Choose a reason for hiding this comment

VivekPanyam commented Aug 7, 2020 •

edited

Loading

codecov bot commented Mar 25, 2021 •

edited

Loading